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EXHAUSTIVE SELECTION OF RNA APTAMERS 
AGAINST COMPLEX TARGETS 

This application claims priority benefit of U.S. Provisional Patent 
5 Application Serial No. 60/391,255 filed June 24, 2002, which is hereby incorporated 
by reference in its entirety. 

This invention was made in part with government support under US 
Public Health Service Grant GM40918. The U.S. government may have certain rights 
to this invention. 

10 

FIELD OF THE INVENTION 

The present invention generally relates to the fields of molecular 
biology and genetics, and is more particularly directed to the exhaustive isolation of 
1 5 novel RNA ligands (aptamers) to complex molecular or super-molecular targets. The 
processes have applications in both basic biological research and therapeutic 
intervention, especially in the areas of drug discovery and functional genomics. 

BACKGROUND OF THE INVENTION 

20 

Over the past decade, numerous methods that combine the power of 
combinatorial chemistry and high-throughput screening have been developed to 
generate novel ligands that are critical to both basic research and therapeutic 
applications. Among these methods, the adaptive molecular evolution techniques with 

25 RNA, also known as SELEX (Tuerk and Gold, "Systematic Evolution of Ligands by 
Exponential Enrichment: RNA Ligands to Bacteriophage T4 DNA Polymerase," 
Science 249:505-510 (1990)), is particularly powerful thanks to the unique features of 
the RNA molecules, which not only carry information for their own replication but 
also fold into well-defined shapes. The novel ligands, or aptamers (Ellington and 

30 Szostak, "In vitro Selection of RNA Molecules that Bind Specific Ligands," Nature 
346:818-822 (1990)), generated by this SELEX process are capable of binding to a 
wide variety of targets with high affinity and specificity (Gold et al, "Diversity of 
Oligonucleotide Functions," Ann. Rev. Biochem. 64:763-797 (1995); Wilson and 
Szostak, "In vitro Selection of Functional Nucleic Acids," Ann. Rev. Biochem. 

35 68:611-647(1999)). 
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There has been a growing need in the drug discovery and functional 
genomics fields to develop methods that would yield aptamers against multiple targets 
in a single selection experiment. Experiments have been performed against multi- 
subunit enzymes (Brown and Gold, "Template Recognition by an RNA-dependent 
5 RNA Polymerase: Identification and Characterization of Two RNA Binding Sites on 
Qp Replicase," Biochemistry 34:14765-14774 (1995); Brown and Gold, "RNA 
Replication by QP Replicase: a Working Model," Proc. Natl Acad. Set USA 
93:11558-11562 (1996)), viral particles (Pan et al, "Isolation of Virus-neutralizing 
RNAs from a Large Pool of Random Sequences," Proc. Natl Acad. Set USA 

10 92 : 1 1 509- 1 1 5 1 3 (1 995)), organelles (Ringquist et al, "High-affinity RNA Ligands to 
Escherichia coli Ribosomes and Ribosomal Protein SI : Comparison of Natural and 
Unnatural Binding Sites," Biochemistry 34:3640-3648 (1995)), and entire cells 
(Morris et al, "High Affinity Ligands from in vitro Selection: Complex Targets," 
Proc. Natl Acad. Sci. USA 95:2902-2907 (1998)), with different degrees of success. 

15 In some cases, different families of ligands were identified for different targets in the 
mixture. However, these aptamers recognize the most abundant or easily recognizable 
target sites, which are not necessarily the most desired ones. No existing literature, i.e. 
none of the experiments performed so far and none of the currently available methods, 
offers an approach which is capable of generating different ligands to all of the targets 

20 in a mixture. 

The most important step in the SELEX procedure is the partitioning 
step in which the target-binding species is physically separated from the non-binding 
species. The immobilization matrix used as partitioning device can act as an unwanted 
target to generate unwanted aptamers that often dominate the selected pool. Several 

25 methods are commonly in use to avoid this problem (Conrad et al, " In vitro Selection 
of Nucleic Acid Aptamers that Bind Proteins," Methods in Enzymology 267:336-367 
(1996)). First, matrix-binding species may be eliminated by negative selection against 
the matrix, i.e., collecting the fraction not bound to the matrix. However, in early 
cycles, when the copy number of each clone is low, this extra handling may increase 

30 the chance of stochastic events in which a particular sequence is lost. In later rounds, 
when the matrix bound species becomes the dominant sequence population in a pool, 
this method may not be efficient enough to eliminate them. Second, since the number 
of binding site on the surface of the matrix is fixed, increasing the amount of the 
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target may change the ratio of the matrix-binding species to target-binding aptamers 
in favor of the recovery of the target-binding aptamers in the partitioning step. But the 
number of binding sites on the matrix can be extremely large compared to that on the 
target even at its highest level, thus rendering this method ineffective. In addition, 
5 higher target amounts will also favor the isolation of ligands with lower affinity, thus 
decreasing the efficiency of selection. Third, alternating use of different types of 
matrices should theoretically eliminate molecules binding to either. This method is 
less effective than might be expected because the difference between the commonly 
used matrices is not sufficient to discriminate against common aptamers that can bind 
10 via less specific hydrophobic interactions. Because of the lack of an efficient and 

specific negative selection method, the background problem is still a major reason of 
failure during in vitro selection experiments. 

SUMMARY OF THE INVENTION 

15 

A first aspect of the present invention relates to a method of identifying 
RNA ligands which bind to a target molecule, the method including: treating a first 
pool of RNA ligands that bind more than one target under conditions effective to 
reduce the concentration or eliminate the presence of one or more predominate target- 

20 binding RNA ligands from the first pool of RNA ligands; amplifying the RNA ligands 
in the treated first pool, thereby forming a second pool of RNA ligands that is 
enriched in one or more non-predominate target-binding RNA ligands of the first pool 
but not the one or more predominate target-binding RNA ligands thereof; and 
identifying from the second pool one or more predominate target-binding RNA 

25 ligands that are present in the second pool at a higher concentration than other target- 
binding RNA ligands. 

A second aspect of the present invention relates to a method of 
reducing the concentration or eliminating the presence of unwanted target-binding 
species from a pool of RNA ligands, the method including: providing a pool of RNA 

30 ligands which includes both wanted and unwanted target-binding RNA ligands; 

identifying one or more unwanted target-binding RNA ligands; and treating the pool 
under conditions effective to reduce the concentration or eliminate the presence of the 
one or more unwanted target-binding RNA ligands from the pool of RNA ligands. 
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A third aspect of the present invention relates to an oligoDNA 
molecule that hybridizes to an RNA ligand, which binds to a partitioning matrix, and 
is capable of directing an RNaseH enzyme to cleave a hybrid complex formed by the 
oligoDNA molecule and the RNA ligand. A duplex formed between an oligoDNA 
5 molecule of the present invention and an RNA ligand that binds to a partitioning 
matrix is also disclosed. 

A fourth aspect of the present invention relates to a kit for selecting 
RNA ligands which bind to one or more target molecules, the kit including: a matrix 
for partitioning RNA ligands that bind to one or more target molecules from RNA 
1 0 ligands which do not; an isolated oligoDNA molecule of the present invention; and an 
RNaseH enzyme. 

A fifth aspect of the present invention relates to a nucleic acid aptamer 
that binds to a heat shock factor protein. The nucleic acid can be either DNA or 
RNA. A related aspect of the present invention concerns a multivalent RNA aptamer 

1 5 that includes two or more aptamers that bind to a heat shock factor protein. 

A sixth aspect of the present invention relates to a method of 
modifying activity of a heat shock factor protein that includes: binding a nucleic acid 
aptamer of the present invention to a heat shock factor protein under conditions 
effective to modify the activity of the heat shock factor protein. A related aspect of 

20 the present invention concerns modifying the activity of the heat shock factor protein, 
which thereby modifies a stress response mediated by the heat shock factor protein. 

The present invention offers a general method that allows the 
exhaustive isolation of novel RNA ligands (aptamers) to multiple or complex, even 
inseparable, molecular or super-molecular targets. This is afforded by including 

25 within general iterative selection procedures a step that decreases or eliminates any of 
one or more aptamers or aptamer families whose sequence information are available. 
Predominant RNA ligands that are selected over other RNA ligands, once identified, 
can be decreased or eliminated from a pool of selected aptamers to allow other 
aptamers to dominate the subsequent generations of the pool and be identified. 

30 Unwanted partition matrix-binding species, such as nitrocellulose filter-binding RNA 
ligands, can likewise be decreased or eliminated from a pool of selected aptamers. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic diagram depicting the simplest case of 
exhaustive aptamer selection with two targets. It shows the relationship between the 
5 present invention and the SELEX method. Conventional SELEX yields only Aptamer 
A, which may or may not be desired. The present invention, of which the SELEX 
method is a specific example, allows both aptamer families to be identified. As the 
scheme depicted here can be used iteratively, the present invention also ensures the 
exhaustive isolation of aptamer families in experiments involving more than two 
10 targets. 

Figures 2A-B illustrates relationships between the RNA pools 
generated in one embodiment of the present invention. The ninth generation (G9) of 
three previous SELEX experiments involving the Drosophila B52 (Figure 2A) or HSF 
(Figure 2B) proteins as targets were used as starting materials (G9 B52 , G9 HSF/H , and 

15 G9 HSF/L ). These pools were treated by different combinations of Marking Oligos and 
RNaseH under indicated conditions. MGM is "Multi-G Motif that binds to the 
nitrocellulose filter used as the partitioning matrix. BBS is "B52 Binding Sequence" 
that binds to the intended target protein B52. The "Octamarker" set contains equal 
amount of SEQ ID Nos: 5-9 as specified in the Examples infra. The "Dodecamarker" 

20 set contains equal amount of SEQ ID Nos: 10-14 as specified in the Examples infra. 
"aBBS I & II" are oligonucleotides (SEQ ID Nos: 16 and 17) as specified in the 
Examples infra. Conditions of treatments are implicitly indicated in the name of the 
derivatized pools. For example, G9-M/M(H) were treated twice (-M/M) to eliminate 
the MGM family by the Hybridase. Those pools that are cloned and sequenced are 

25 indicated by boxes encasing their names. 

Figure 3 A is a graph that illustrates the efficiency of the RNA 
restriction treatment. The relative abundance of both BBS and MGM in pools treated 
by different combinations of Marking Oligos and RNaseH under different conditions 
is shown. The abundance of the aptamers in the pools (in the form of DNA) was 

30 determined by the densitometric data of Southern dot blot analysis using 

oligonucleotides with family consensus sequences as probes. The abundance of both 
aptamer families in the untreated G9 852 pool was set as 100. The actual ratio of BBS 
to MGM in this untreated pool is about 1:10. To assess the effect of each treatment, 
the relative abundance of a family before and after the treatment (the pairwise 
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connections below the graph) should be compared. Except for those indicated by 
"(E)", all other treatments were performed using the Hybridase. Figure 3B illustrates 
the effects of MGM restriction treatment, which resulted in a decrease in the size of 
the MGM-dominated G9 pool. The G9 pool and the unselected GO pool were treated 
5 with MGM marking oligos and RNaseH. "8" and "12" indicate the "Octamarkers" 
and "Dodecamarkers", respectively; "0" indicates controls without markers added. 
"E" and "H" indicate the E. coli RNaseH and the Hybridase, respectively. 

Figure 4 is a table that further illustrates the efficiency of MGM 
restriction in three different selected pools with different intended protein targets or 

1 0 selected under different conditions. The results were compiled from sequencing data. 
The abundance of MGM before and after the restriction is shown as percentage of 
MGMs in sample clones sequenced. The two numbers in the parentheses indicate 
actual number of MGM individuals / total number of sequenced individuals. 

Figures 5 A-B are graphs that illustrate the change in abundance of the 

15 MGM sequence family and the aptamer clone RA1-HSF, respectively, in the two- 
stage selection leading to the isolation of the HSF aptamer. The relative abundance of 
MGM in different generations was measured by Southern dot blot analysis using 
oligonucleotides representing the MGM consensus sequences as probes. The relative 
abundance of RA1-HSF in different generations was measured by Southern dot blot 

20 analysis using sequence of the randomized region of this clone as probe. Abundance 
is expressed in arbitrary units. Because of sequence variation among the family 
members and the resulting uneven hybridization efficiency, it is not possible to 
quantify precisely the abundance of a family, or of one family relative to the other, in 
one generation. However, the relative abundance detected by the same probe in 

25 different generations in the same experiment clearly showed the trends and patterns of 
enrichment. The affinity of RA1-HSF to HSF has been confirmed by electrophoretic 
mobility shift assay with different HSF constructs (see Figure 9A infra). Generations 
1 to 9 are those of Selection H. Generation 9a is G9/H-M/M/M. 

Figure 6 is a graph that tracks the enrichment of the MGM sequence 

30 family in three different selection experiments. The relative abundance of MGM in 
different generations was measured by Southern dot blot analysis using 
oligonucleotides representing the MGM consensus sequences as probes. The general 
shape of the curves was confirmed by examining the filter-binding activity of the 
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RNA pools representing each generation. Arrows indicate negative selection against 
filter-binding. 

Figure 7 is a graph that illustrates ion dependency of MGM binding to 
filters. The binding activity of MGM to the filter was examined in buffers only 
5 different in the monovalent ion they contain. Different RNA concentrations were used 
as indicated. The bulk binding activity of GO (the un-selected progenitor pool) and 
G9 B52 in buffer containing K + was compared to set the base line activity. The bulk 
binding activity of G9 B52 in buffer containing K + or Na + was compared to show the 
sensitivity of the binding activity to Na + . RNAs with three individual MGM 
10 sequences and one random sequence were used to further investigate the ion 
dependency. The sequences of these RNAs (I, II, III, and C in the graph) are as 
follows: 

MGM-I (SEQ ID No: 1) 

15 aacguagaac caauaagggu augggaaggg uaaaaggga 39 

MGM-II (SEQ ID No: 2) 

cacaacgauc aaaagaaaag ggagggccgg ggaaggguug gacaaacagg 50 
20 MGM-III (SEQ ID No: 3) 

gcccacgacc aaaacaaagg gaaggaggga gggugcagac gaaagccagg 50 
Ctr (SEQ ID No: 4) 

uacaacaucg uagcguggca acugauggcu uugccgaacu cugaa 4 5 

25 

Figures 8A-C illustrate the correlation between the aptamers' affinity 
to the target and their enrichment dynamics during the second stage of selection 
against the Drosophila HSF protein. Three clones were chosen to examine because 

30 they were isolated in multiple copies, an indication of their being selected by the 
targets. The DNA sequences encoding the randomized regions of these RNA 
aptamers, #14-1 (RA1-HSF), #14-2, and #9a-l are shown in Figure 8C. (The full 
length DNA sequence, from 5' to 3 f , includes nt 1-25 of SEQ ID NO: 31, the 
sequences shown in Figure 8C, and nt 66-90 of SEQ ID NO: 31.) Their relative 

35 abundance over time between generations G9a and G14 (Figure 8A) shows that #14-2 
and #9a-l were out-competed by #14-1 . Their fate can be explained by their relative 
affinity to HSF, as shown in Figure 8B, which is an electrophoretic mobility shift 
assay with lOOnM of either HSF or BSA (bovine serum albumin). 
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Figures 9A illustrates the affinity of the aptamer RA1-HSF (#14-1) to 
different construct of the protein. The left panel shows the result with three HSF 
fusion constructs prepared from bacteria. Numbers indicate concentration in 
nanomolar. #14-8 is a negative control. The right panel shows binding of this 
5 aptamer to HSF expressed from a baculovirus vector in the Sf9 insect cells (bvHSF). 
The lane labeled Sf9 contains crude cell extract from this cell line without the 
expression vector. A truncated version of HSF (bvHSF-deltaC) was also tested for 
aptamer binding. Figure 9B shows the results of deletion analyses performed to 
define the true aptamer moiety of RA1-HSF. The top bar above the sequence defines 
10 the 40 nt randomized region (central) and the 25 nt 5' constant region (left side) and 
the 25 nt 3' constant region (right side). 

DETAILED DESCRIPTION OF THE INVENTION 

1 5 One aspect of the present invention is a general method that allows the 

exhaustive isolation of novel RNA ligands (aptamers) to complex, even inseparable, 
molecular or super-molecular targets. The targets can be natural or synthetic small 
molecules, macromolecules and supramolecular assemblies, and combinations 
thereof. Exemplary targets can include, without limitation, nucleotides, peptides, 

20 proteins or polypeptides, protein-protein complexes, protein-nucleic acid complexes, 
viruses, cells, etc. 

Although SELEX was traditionally intended to generate aptamers 
against single targets, in a formal sense all SELEX experiments contain multiple 
targets. Mixtures that are used as targets in SELEX experiments come in the 

25 following common forms. First, the target can be a mix of multiple distinctive 

molecules, where aptamers against all target molecules are desirable. The targets may 
be separable, but a simultaneous selection is more economical. Second, a single target 
molecule can contain multiple discrete binding sites on its surface, where aptamers 
against all sites are desirable, but separation of the different target sites is physically 

30 impossible. Third, a presumably pure target preparation may contain an unknown 
contaminant, which may select unwanted aptamers that come to dominate the 
population. Finally, even the traditional single target selection requires a partitioning 
matrix (e.g. filters used to collect RNA-protein complexes) that can function as an 
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unwanted but largely inseparable target, which often selects undesired aptamers with 
the potential to overrun the population. 

In a multi-target selection experiment, the most important issue is to 
ensure that different ligands to all of the targets in a mixture can be generated. The 
5 present invention affords an exhaustive approach to achieve this objective. In SELEX 
or other selection procedures, the step of partitioning preferentially allows those 
molecular species that are fit to perform certain tasks (i.e., binding to the target) to 
reproduce. The change of the relative concentration of a certain sequence or sequence 
family under competition over successive generations is determined by its growth 

1 0 constant, which determines the rate of self-reproduction. If the growth constant of a 
certain species is greater than the average growth constant due to a higher recovery in 
partitioning, its relative concentration will increase; otherwise, it will decay. During 
the progression of selection, the average growth rate will increase steadily with cycles 
of selection and amplification until it reaches a maximum, when the relative 

1 5 concentration of the one sequence (or family of sequence) having the maximal growth 
rate increases to one, and the relative concentrations of all other sequences decay to 
zero. This "extremum principle" (Eigen, "New Concepts for Dealing with the 
Evolution of Nucleic Acids," Cold Spring Harbor Symposia on Quantitative Biology 
52:307-320 (1987), which is hereby incorporated by reference in its entirety) 

20 describes a behavior different from chemical equilibration. While chemical 

equilibration will yield definite concentration ratios determined by free energy of each 
species, selection eventually gives rise to only one survivor or a group of equivalently 
fitting survivors, regardless of whether the system as a whole grows or whether it 
reaches a steady state. 

25 This outcome contradicts the aforementioned desire to isolate different 

ligands to multiple targets in a mixture, as only one or a very few sequence families 
with high relative growth rates will have the opportunity to be enriched and identified. 
The difference between the growth constant of different ligand families during a 
selection against multiple targets is determined not only by the difference in target 

30 concentration and binding affinity, but also by the difference in other unidentified 
factors that determine the topography of the fitness landscape of different families. 
All these variables (save the relative target concentration in some cases) are unknown 
at the onset of the experiment. To ensure that aptamers against all potential target sites 
are isolated, the present invention allows for adjusting of the growth constant of any 
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aptamer clone or aptamer family based on their sequence information, without having 
to rely on the change of target concentration and affinity, and without interfering with 
the normal selection and amplification steps. 

Although the present invention is not intended to be limited to any one 
5 particular selection and amplification procedure, it is convenient to discuss the present 
invention as it is used in a SELEX experiment with RNA. It should therefore be 
appreciated by those of ordinary skill in the art that the present invention also can be 
used in combination with, e.g., in vitro selection for enzymatic activities, in vitro 
selection using DNA or modified oligonucleotides, or in vitro selection with peptides. 

1 0 An RNA aptamer for a protein or other kind of target usually mimics 

the shape of a natural ligand of the target. As the identity of a mimic is defined by the 
authentic, a mimic in a different chemical nature may not be as perfect as what it 
mimics. Although RNA is an extraordinarily versatile type of molecule, it cannot be 
guaranteed that an RNA ligand always exists for a binding site on a protein domain 

1 5 naturally occupied by a non-RNA molecule. On the other hand, multiple different 
RNA sequence/structure solutions may exist to fit some sites. Therefore, exhaustive 
selection of RNA aptamers in the sense of "one for each site" is not realistic if the 
sites are not defined by RNA. Nonetheless, the method described herein pursues 
exhaustiveness in aptamer identification in a practical sense: Every RNA ligand 

20 existing in the starting sequence pool should be isolated. Starting with a large diverse 
pool for every new stage may yield multiple aptamers for each binding site on the 
targets. 

In a SELEX experiment, genetic selection is applied directly to 
populations of RNA molecules that possess both genotypes (a sequence) and 

25 phenotypes (a binding activity that varies according to sequence). The conventional 
SELEX method attempts to recapitulate the natural Darwinian evolution process, in 
which the selection is based on phenotype (e.g. binding capability possessed by a 
folded RNA) and amplification is based on genotype (base paring during PCR). While 
the fitness of molecules (their ability to be enriched) may be also affected by their 

30 relative efficiency of enzyme-mediated replication, this is intentionally minimized, 
rather than explored, in the process of the experiment in order to keep the selection 
pressure solely on the phenotype. The nucleotide sequence of a nucleic acid molecule, 
as the physical embodiment of the information encoded therein, can be used in itself 
as the criterion for either positive or negative selection. This feature has been explored 
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in some molecular computation experiments to solve hard combinatorial optimization 
problems (Adleman, "Molecular Computation of Solutions to Combinatorial 
Problems," Science 266: 1 021 -1 024 (1 994), which is hereby incorporated by reference 
in its entirety). But when a sequence functions as the genotype of an organism, it is 
5 normally not accessible and subject to selection; and when it acts as the genotype of 
an aptamer in an ideal "single target selection", it is unknown until it is enriched 
according to its phenotype to the point of its identification. Once the sequence of an 
aptamer is identified in such an experiment, it loses its value of being a selection 
criterion for itself, since by then the selection has achieved its practical goal and is 

10 considered finished. The present invention provides a scheme of negative selection 
according to genotype, which utilizes the sequence information to reduce the relative 
size of particular aptamer populations during the process of selection against multiple 
targets. More specifically, it allows the resumption of selection/amplification to 
identify less abundant aptamers to other targets, once an aptamer family is identified 

1 5 due to its high growth rate. 

According to one aspect of the present invention, a method of 
identifying RNA ligands that bind to a target molecule is provided. This method is 
preferably performed as an exhaustive selection procedure to identify substantially all 
RNA ligands that appear in a pool of ligands and possess a particular phenotype (i.e., 

20 binding to a target). 

A pool of RNA ligands that collectively bind to more than one target is 
prepared by first generating a library of RNA molecules, which likely includes both 
RNA ligands that bind at least one of one or more targets and RNA molecules that do 
not bind any of the targets, and then partitioning the library of RNA molecules to 

25 form a first pool of such RNA ligands according to the conventional selection and 
amplification procedure (e.g., SELEX). Partitioning of the RNA library is achieved 
by physically separating the non-target binding RNA molecules from target-binding 
RNA ligands using a partitioning matrix as is known in the art. Suitable partitioning 
matrices include, without limitation, nitrocellulose, agarose beads, Biacore sensor 

30 chip, and any insoluble substance to which a target can be fixed or retained, or 

combinations thereof. Matrices can be selected based on pore size or binding affinity 
for target molecules (i.e., including target:RNA complexes). As noted hereinafter, the 
partitioning matrix may itself behave as a target, allowing recovery of RNA ligands 
that bind to the matrix rather than a desired target molecule. 
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The resulting first pool of target-binding RNA ligands includes RNA 
ligands that bind to at least one of the one or more targets used during the selection 
and partitioning procedure. Individual RNA ligands may or may not bind to more 
than one target; but collectively the pool of RNA ligands binds to more than one 
5 target. After forming the first pool of target-binding RNA ligands, the one or more 
target-binding RNA ligands which predominate within the first pool (specifically, 
within the sample therefrom) are isolated and sequenced using conventional (i.e., 
preferably automated) sequencing procedures. Having identified the one or more 
predominate target-binding RNA ligands, it is also possible to compare sequences to 

1 0 identify whether a family of sequences exist that possess a consensus sequence. 

Once the sequence of the one or more predominate target-binding 
RNA ligands has been determined, one or more oligonucleotides are designed 
specifically to hybridize to the one or more predominate target-binding RNA ligands 
or, alternatively, to the consensus sequence for a family of predominate target-binding 

1 5 RNA ligands. The oligonucleotides are preferably at least about 8 nucleotides in 
length, more preferably between about 10 and 100 nucleotides in length, most 
preferably between about 10 and about 40 nucleotides in length. It should be 
appreciated by those of ordinary skill in the art that longer or shorter oligonucleotides 
can be used; the oligonucleotides, however, must be specific for the predominate 

20 target-binding RNA ligands or the family thereof. 

The first pool of RNA ligands is then treated to reduce the 
concentration or eliminate the presence of one or more predominate target-binding 
RNA ligands from the first pool of RNA ligands. Specifically, oligonucleotides that 
hybridize with the predominate target-binding RNA ligands (or a family thereof) are 

25 introduced into the first pool under suitable hybridization conditions to cause the 
oligonucleotides to hybridize to the one or more predominate target-binding RNA 
ligands, forming hybrid complexes. The hybridization conditions will likely vary 
depending upon the sequences of the one or more predominate target-binding RNA 
ligands (or family thereof) and the oligonucleotide introduced into the first pool. 

30 Exemplary hybridization ponditions include, without limitation, 200ng RNA (6 
pmole) and 150 pmole marking oligonucleotides in 20jli1 RNaseH buffer (50 mM 
Tris-Cl, pH 7.4, 100 mM NaCl, 10 mM MgCl 2 ) with pre-incubation at 72°C for 3 
minutes and incubation at 37°C for 30 minutes. One skilled in the art will appreciate 
that conditions for nucleic acid hybridization, including temperature, salt, and the 
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presence of organic solvents, are variable depending upon the size (i.e., number of 
nucleotides) and the G-C content of the nucleic acids involved, as well as the 
hybridization assay employed ( See, e.g ., Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Laboratory, Cold Spring Harbor, New York (1989); 
5 Nucleic Acid Hybridization: A Practical Approach, Haimes and Higgins, Eds., 
Oxford:IRL Press (1 988), each of which is hereby incorporated by reference in its 
entirety). 

After allowing a suitable period of time for hybridization to occur, a 
number of hybrid complexes (RNA ligand:oligonucleotide) will have formed within 

10 the treated first pool. To this mixture, an enzyme is added which recognizes the RNA 
hybrid complex and cleaves at least the RNA ligand of the hybrid complex or, 
alternatively, both strands of the hybrid complex. For example, an RNaseH enzyme is 
utilized in combination with oligoDNAs that hybridize to the predominate target- 
binding RNA ligands, cleaving the duplex-forming RNA ligands and thereby 

1 5 destroying the aptamer sequence of those RNA ligands. The oligoDNA is then 
released from the hybrid complex and is free to hybridize with other predominate 
target-binding RNA ligands (and thereby promote further RNaseH activity). For the 
size of hybridization described above by way of example (6 pmole RNA and 150 
pmole oligonucleotides), 1 unit of RNaseH is added during the annealing after the 

20 sample is at 37°C. Of course, more or less enzyme can be added depending upon the 
size of the hybridization procedure. Thus, following hybridization and enzyme 
treatment, the first pool has effectively been treated (forming a treated first pool) so as 
to eliminate or otherwise substantially reduce the population of what was previously 
the one or more predominate target-binding RNA ligand. 

25 Thereafter, the RNA ligands in the treated first pool are amplified 

using conventional reverse transcription amplification procedures, e.g., RT-PCR. As a 
result, a second pool of RNA ligands is formed. The second pool of RNA ligands is 
enriched in one or more non-predominate target-binding RNA ligands of the first pool 
but not the one or more predominate target-binding RNA ligands thereof. It is from 

30 this second pool of RNA ligands that one or more new predominate target-binding 
RNA ligands are identified. The one or more new predominate target-binding RNA 
ligands of the second pool are present at a higher concentration than other target- 
binding RNA ligands. Moreover, one or more new predominate target-binding RNA 
ligands of the second pool are different from the one or more predominate target- 
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binding RNA ligands from the first pool. In other words, because the one or more 
predominate target-binding RNA ligands of the first pool were substantially reduced 
in population or eliminated from the treated first pool, the same target-binding RNA 
ligands will not predominate during successive rounds. 
5 Subsequently, the one or more predominate target-binding RNA 

ligands of the second pool are identified as described above (i.e., isolated and 
sequenced) and then the second pool is treated in a manner similar to the above- 
described treatment of the first pool, thereby substantially reducing the concentration 
or eliminating the presence of one or more predominate target-binding RNA ligands 

10 of the second pool. Following amplification of the treated second pool, the RNA 

ligands therein are amplified as described above, forming a third pool that is enriched 
in one or more non-predominate target-binding RNA ligands of the second pool but 
not the one or more predominate target-binding RNA ligands thereof. 

This same process is repeated for the third pool and each subsequent 

1 5 pool prepared thereafter: namely, identifying predominate target-binding RNA 

ligands, treating the pool of RNA ligands to substantially reduce the concentration or 
eliminate the presence of the one or more predominate target-binding RNA ligands 
therein, and amplifying the treated pool. Eventually, after two or more rounds of this 
entire process, substantially all of the non-predominate target-binding RNA ligands of 

20 the first pool will have been identified. 

In addition to the foregoing, it may be desirable to remove certain 
RNA molecules from each pool in each round or in alternating rounds of conventional 
selection and amplification, depending on their population growth within successive 
pools of RNA ligands. Specifically, RNA ligands that bind to the partitioning matrix 

25 are undesirable and should periodically or systematically be removed from the 
population of RNA ligands which define each pool. To achieve this result, 
oligonucleotides known to hybridize with RNA ligands that bind to the partitioning 
matrix can simultaneously be introduced along with oligonucleotides that hybridize to 
the identified one or more predominate target-binding RNA ligands. The 

30 oligonucleotides preferably possess the same characteristics and properties as the 
oligonucleotides described above. 

From the foregoing description, it should be appreciated by those of 
skill in the art that a predominate target-binding RNA ligand, once identified, 
becomes (much like the RNA ligands that bind to the partitioning matrix) 
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operationally undesirable or unwanted. Because the sequence for such an RNA ligand 
is known, the sequence information is used to earmark the unwanted RNA ligands in 
the population and subject them to destruction using the treatments described above 
and demonstrated in the example hereinafter. Iterative rounds of this growth-rate 
5 reduction step will ensure that any ligand family, regardless of their initial relative 
growth rate, should have an opportunity to dominate the population and thus be 
identified in successive selection cycles. 

As an alternative to working with RNA ligands, it should be 
appreciated by those of ordinary skill in the art that DNA ligands that bind to at least 

1 0 one of one or more targets can be identified in substantially the same way. Exemplary 
approaches for in vitro selection of DNA ligands is described for example in Bock et 
al., "Selection of Single Stranded DNA Molecules That Bind and Inhibit Human 
Thrombin," Nature 355:564-66 (1992); and Huizenga et al., "A DNA Aptamer That 
Binds Adenosine and ATP," Biochemistry 34:656-65 (1995), each of which is hereby 

1 5 incorporated by reference in its entirety. Thus, the library of DNA molecules is 
exposed to one or more target molecules. Thereafter, either a partitioned or an 
unpartitioned library of DNA ligands is transcribed to form a library of RNA 
molecules as described (which itself is either partitioned or unpartitioned). The 
transcription can be achieved by using, e.g., PCR primers for transcription that 

20 include a T7 promoter. Once the library of RNA molecules is obtained, the library 
can be treated to substantially reduce the concentration or eliminate the presence of 
one or more predominate target-binding RNA ligands. The treated pool of RNA can 
then be converted back into DNA using any suitable reverse transcription procedure 
(i.e., RT-PCR). 

25 Regardless of the approach, once the exhaustive selection of RNA 

ligands has been completed or, rather, once any one individual RNA ligand has been 
obtained, the RNA ligands can then be used to develop multivalent RNA aptamers (as 
well as constructs encoding the same), as described in U.S. Patent No. 6,458,559 to 
Shi et al., which is hereby incorporated by reference in its entirety. 

30 For purposes of illustration, the simplest case of an exhaustive 

selection against two targets is shown in Figure 1 . This scheme can be extended to 
more complicated ones along two directions. First, the conventional SELEX and the 
method of the present invention do not have to be used in a chronologically discrete 
manner as depicted in Figure 1 . Rather, a population restriction step can be 
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incorporated into any or all SELEX cycles. Second, more than two targets, and in 
turn, more than two aptamer families can be included in a selection experiment. And, 
a single restriction step can treat more than one family. While sometimes more than 
one family of aptamers may be identified by the conventional SELEX, it only ensures 
5 the isolation of one aptamer family in a process governed by the "extremum 

principle" stated above. The present invention thus implements a "multiple extrema" 
process, in distinct steps, to ensure the exhaustive isolation of all aptamer families 
against a mixture of targets. Therefore, the present invention is not merely an 
improvement of the conventional SELEX method. Rather, it is a more general method 

10 that is applicable for use in conjunction with most any amplification and selection 

protocol. Use of the present invention with the conventional SELEX method, such as 
that described by US. Patent No. 5,270,163 to Gold et al., which is hereby 
incorporated by reference in its entirety, is only a single embodiment. 

Another aspect of the present invention is a general method that 

15 reduces the concentration or eliminates the presence of unwanted RNA ligands (i.e., 
previously identified RNA ligands, especially matrix-binding RNA ligands) which 
may otherwise thwart the isolation of desired aptamers against intended targets from a 
pool or population of RNA ligands. Basically, this procedure is carried out as 
described above, by identifying one or more unwanted target-binding RNA ligands 

20 present in a pool of RNA ligands, which includes both wanted and unwanted target- 
binding RNA ligands, and then treating the pool under conditions effective to reduce 
the concentration or eliminate the presence of the one or more unwanted target- 
binding RNA ligands from the pool of RNA ligands. Oligonucleotide and enzyme 
treatment for destruction of hybridization complexes is employed, as described above 

25 in the exhaustive selection procedure of the present invention. 

A selection cycle can be conceptually divided into three steps: 
selection, partitioning, and amplification (Irvine et al, "SELEXION: Systematic 
Evolution of Ligands by Exponential Enrichment with Integrated Optimization by 
Non-linear Analysis,' V. Mol Biol 222:739-761 (1991), which is hereby incorporated 

30 by reference in its entirety). This simplistic representation treats the selection step as 
an equilibrium binding process with the intended target(s). However, during selection 
the partitioning matrix, which is not an intended target and not necessarily at 
equilibrium with the aptamer candidates, is also exposed to the entire candidate pool. 
Binding of certain candidates to the matrix may occur, not necessarily (but 
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nonetheless possibly) during the designated binding step, but definitely during the 
partitioning step that follows it. As the candidates bound to the partitioning matrix are 
co-partitioned with those bound to the explicitly designated target(s), the matrix acts 
as a de-facto target in actual experiments. RNA species selected by their interaction 
5 with the partitioning matrix are usually not desired and regarded as "background". 
The large number of potential binding sites on the matrix present during the 
partitioning step and the high partitioning efficiency of species bound to the matrix 
often result in the dominance of the selected pools by such unwanted species. Without 
effective negative selection, this may severely thwart the isolation of aptamers against 

1 0 the intended targets. The sequence directed RNA restriction method presented here 
provides an effective approach to the solution of this problem. With this method, the 
negative selection can be performed against genotype in addition to phenotype, thus 
making the negative selections collectively more efficient and specific. 

Yet another aspect of the present invention is a method that 

1 5 specifically eliminates the unwanted filter-binding RNA generated when 

nitrocellulose filter is used as the partitioning device in a selection/amplification 
experiment. When the target in a selection experiment is a protein, a nitrocellulose 
filter is the popular partitioning matrix because it generally binds proteins and protein 
complexes but not nucleic acids (Y arus and Berg, "On the Properties and Utility of a 

20 Membrane Filter Assay in the Study of Isoleucyl-tRNA Synthetase," Analytical 

Biochemistry 35:450-465 (1970), and Yarus, "Adsorbent Filters: a New Technique for 
Microexperimentation on Nucleic Acid," Analytical Biochemistry 70:346-353 (1976), 
each of which is hereby incorporated by reference in its entirety). However, as shown 
in the accompanying Examples infra, this matrix can be a major reason behind the 

25 failure of selection experiments. The present invention provides specific reagents and 
schemes to solve this particular instance of the "background problem". First, when the 
filter-binding sequence family is identified, enzymatic treatment with RNaseH is the 
most effective method to get rid of those for which a marking oligonucleotide of 
sufficient length can be designed. Second, after removing the majority covered by a 

30 reasonably representative set of marking oligonucleotides, the remaining species can 
be more effectively eliminated by conventional negative selection against phenotype, 
i.e., filter binding. Third, since this family's affinity to the filter is dramatically 
compromised in the absence of potassium ion, alternative use of potassium and 
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sodium in the binding buffer is another choice during selection when physiological 
conditions need to be mimicked. 

A further aspect of the present invention relates to one or more kits 
which can be used for selecting RNA ligands that bind to one or more target 
5 molecules and, in particular, for selective removal of RNA ligands that bind to a 
partitioning matrix. The kits include a matrix for partitioning RNA ligands that bind 
to one or more target molecules from RNA ligands which do not, one or more 
oligoDNA molecules of the present invention (as described above), and a suitable 
enzyme (e.g., RNaseH) that cleaves a hybrid complex formed by the oligonucleotide 

1 0 molecule and RNA ligands that bind to the matrix. The kits can be used alone or in 
combination with other components useful in practicing the present invention. For 
example, the kits can also include reagents for amplifying RNA ligands (e.g., reverse 
transcriptase, DNA and/or RNA polymerases, dNTPs, rNTPs, etc.). 

As one embodiment of the present invention, a previously performed 

1 5 selection experiment was extended. The prior selection experiment yielded two 

families of RNA aptamers, one being dominant over the other, against two disparate 
targets: the intended target protein and the partitioning matrix. This previous work is 
described in U.S. Patent No. 6,458,559 to Shi et al., which is hereby incorporated by 
reference in its entirety. By applying the population restriction method of the present 

20 invention, the relative size of these aptamer sub-populations in the selected pool was 
adjusted. Since the previously dominant sub-population contained RNA species that 
bind to nitrocellulose filters, which is the most widely used partitioning matrix for 
protein targets, this also yielded a specific method and a group of specific reagents for 
decreasing this common background in these experiments. This method was tested on 

25 two other selection experiments involving a different target. 

In isolating RNA aptamers against the Drosophila splicing factor B52, 
the original pool was carried through nine cycles of selection and amplification with 
filter binding as the partitioning device (Shi et al, " A Specific RNA Hairpin Loop 
Structure Binds the RNA Recognition Motifs of the Drosophila SR Protein B52," 

30 Mol. Cell Biol 1 7: 1649-1657 (1997), which is hereby incorporated by reference in its 
entirety). No negative selection against the filter was performed until the last cycle, in 
which the recovered bound RNAs were passed through the same kind of filter before 
being amplified by RT-PCR. After cloning the pool of the ninth generation (G9), the 
binding capability of the individual clones to the B52 protein was tested in a band 
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shift assay. Among the 66 clones tested in one sampling, only 4 showed high-affinity 
specific binding. Their sequence contained a consensus that was later identified as the 
B52 Binding Sequence (BBS). Sequences of the clones that were not able to bind B52 
belong to a family whose members contain around four (usually 3-5) G-triplets. In 
5 most cases these G-triplets occur within a sequence segment 25-nt in length, and there 
is often an A preceding the G-triplet. Sequences meeting these criteria were termed 
"Multi-G Motif (MGM)". A filter-binding assay revealed dramatic increase of filter 
binding capability of the G9 pool over the unselected pool GO, and the same assay 
with individual MGM clones confirmed their contribution to the pool's increased 

10 average filter-binding affinity. These results indicated that two families of aptamers 
were selected by two different targets in this selection experiment. 

With the identified activities and corresponding consensus sequences 
of these two families, a detailed retrospective analysis was performed on the evolving 
RNA populations to study the relationship between the selected RNAs and the 

1 5 random sequence pool from which they arise (Shi et al, "Evolutionary Dynamics and 
Population Control During in vitro Selection and Amplification with Multiple 
Targets," RNA. 8:1461-1470 (2002), which is hereby incorporated by reference in its 
entirety). The data obtained in these studies on population dynamics indicate that this 
selection experiment is a valuable model system for SELEX against two or more co- 

20 existing targets. Moreover, BBS and MGM represent both extremes of the consensus 
aptamer sequence in terms of their length and degree of homology. Successful 
ablation of both from the pool, either successively or simultaneously, demonstrated 
the efficiency and versatility of the method presented here. 

Using the exhaustive selection procedure of the present invention to 

25 identify RNA aptamers of the Drosophila heat shock factor protein, three RNA 

aptamers were identified. The DNA molecules (i.e., reverse transcript) of the three 
RNA aptamers are the DNA molecules of SEQ ID Nos: 31 (14-1), 32 (14-2), and 33 
(9a- 1) as set forth below: 

30 14-1 

gggagaattc aactgccatc taggcatcgc gatacaaaat taagttgaac gcgagttctc 60 
catctagtac tacaagcttc tggactcgat 90 

14-2 



35 



gggagaattc aactgccatc taggcaagta gctaggagtc cttctcccct caaaacagaa 60 
tggggagtac tacaagcttc tggactcgat 90 
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9a-l 

gggagaattc aactgccatc taggcggcaa gctacgcgtc aaatagcaag cacaccgaag 60 
5 acacaagtac tacaagcttc tggactcgat 90 

The corresponding RNA aptamers encoded by the DNA molecules of SEQ ID Nos: 
31, 32, and 33 are set forth below as SEQ ID Nos: 34, 35, and 36, respectively: 

10 14-1 

gggagaauuc aacugccauc uaggcaucgc gauacaaaau uaaguugaac gcgaguucuc 60 
caucuaguac uacaagcuuc uggacucgau 90 

14-2 

15 gggagaauuc aacugccauc uaggcaagua gcuaggaguc cuucuccccu caaaacagaa 60 
uggggaguac uacaagcuuc uggacucgau 90 

9a-l 

20 gggagaauuc aacugccauc uaggcggcaa gcuacgcguc aaauagcaag cacaccgaag 60 
acacaaguac uacaagcuuc uggacucgau 90 

In the DNA molecules of SEQ ID Nos: 31-33 and the RNA molecules of SEQ ID 
Nos: 34-36, nt 1-25 and nt 66-90 represent conserved regions, whereas nt 26-65 in 

25 each of the above-identified sequences represents a variable region that was selected 
during the in vitro selection procedure performed in accordance with the present 
invention and described in Examples 3 and 6 infra. 

The target bound by the DNA molecules of SEQ ID Nos: 31-33 and 
the RNA molecules of SEQ ID Nos: 34-36 are heat shock factors. Heat shock factors 

30 regulate the activation of heat shock gene transcription by binding to specific 
promoter elements (Wu, "Heat Shock Transcription Factors: Structure and 
Regulation", Ann. Rev. Cell Dev. Biol. 1 1:441-469 (1995), which is hereby 
incorporated by reference in its entirety). Upon heat treatments, HSF forms 
homotrimers that bind to specific DNA sequences called heat shock elements (HSEs). 

35 The HSF senses and integrates signals from extracellular and intracellular 

environments and regulates the expression of heat shock genes whose products are 
chaperones that help proteins maintain stable folded conformations. Because heat 
shock factors are highly conserved proteins, the DNA ligands and RNA ligands that 
bind to the Drosophila heat shock factor proteins may also bind to vertebrate and, 

40 more specifically, mammalian heat shock factor proteins. 
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Thus, a further aspect of the present invention relates to a method of 
modifying activity of a heat shock factor protein and a method of modifying a stress 
response mediated by a heat shock factor protein. These methods include the step of 
binding the RNA aptamer of the present invention to a heat shock factor protein under 
5 conditions effective to modify the activity of the heat shock factor protein, thereby 
modifying a stress response mediated by the heat shock factor protein. In particular, 
by modifying the activity of the heat shock factor protein, it is possible to modify the 
level of transcription of genes under regulatory control of heat shock elements. 

10 EXAMPLES 

The following examples are provided to illustrate embodiment of the 
present invention. But they are by no means intended to limit its scope. 

15 Materials & Methods 

The materials and methods described below are applicable for all of 
the following examples. 

Oligonucleotides: 

20 The oligonucleotides used as "markers" in the RNA restriction 

treatment and probes in the Southern blot analysis are listed below. 

G6A2 (SEQ ID No: 5) 

cccttccc 8 



25 



A2G6 (SEQ ID No: 6) 

ccctccct 



A3G5 (SEQ ID No: 7) 

30 ccctcctt 



TAG6 (SEQ ID No: 8) 

cccctcca 



35 CTG6(SEQIDNo:9) 

cccacccg 



G6A2N4 (SEQ ID No: 10, where N=A+T+G+C) 

nncccttccc nn 12 



40 
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A2G6N4 (SEQ ID No: 1 1 , where N=A+T+G+C) 

nnccctccct nn 12 
A3G5N4 (SEQ ID No: 12, where N=A+T+G+C) 

5 nnccctcctt nn 12 

TAG6N4 (SEQ ID No: 13, where N=A+T+G+C) 

nncccctcca nn 12 

1 0 CTG6N4 (SEQ ID No: 1 4, where N=A+T+G+C) 

nncccacccg nn 12 

NCW13 (SEQ ID No: 15, where W= A+T and N=A+T+G+C) 

nncccwwwcc cnn 13 



15 



40 



Anti-BBS I (SEQ ID No: 16) 

cggtcgcctg gttgacc 17 



Anti-BBS II (SEQ ID No: 17) 

20 ctgtcgccag gttgatc 17 

ForT7#2 (SEQ ID No: 18) 

gtaatacgac tcactatagg gagaattcaa ctgccatcta 40 



25 The individual MGM species and a control RNA with random 

sequence were prepared by in vitro transcription (see below) from templates made 
from the following oligonucleotides: 

EDA45T7FOR (SEQ ID No: 19) 

30 gtaatacgac tcactatagg ctacaacatc gtagcgtgg 37 

EDA45COMP (SEQ ID No: 20) 

ttcagagttc ggcaaagcca tcagttgcca cgctacgatg ttgta 45 

35 MGM40T7FOR (SEQ ID No: 21) 

gtaatacgac tcactatagg caacgtagaa ccaataag 38 

MGM40COMP (SEQ ID No: 22) 

tcccttttac ccttcccata cccttattgg ttctacgttg 40 



MGM50IT7FOR (SEQ ID No: 23) 

gtaatacgac tcactatagg cacaacgatc aaaagaa 37 



MGM50ICOMP (SEQ ID No: 24) 

45 cctgtttgtc caacccttcc ccggccctcc cttttctttt gatcgttgtg 



50 
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MGM50IIT7FOR (SEQ ID No: 25) 

gtaatacgac tcactatagg gcccacgacc aaaacaa 37 
5 MGM50IICOMP (SEQ ID No: 26) 

cctggctttc gtctgcaccc tccctccttc cctttgtttt ggtcgtggg 4 9 

RA1-HSF, randomized region in the isolated clone (SEQ ID No: 27) 

atcgcgatac aaaattaagt tgaacgcgag ttctccatct 4 0 

10 

All oligonucleotides were synthesized at the 50 nmole scale by Operon Inc. 
(Alameda, CA). 

RNA Restriction Treatment: 

1 5 The RNA was prepared using the T7-MAGAshortscript™ in vitro 

transcription kit (Ambion, Inc., Austin, TX) according to the manufacturer's 
instructions. The E. coli RNaseH and the thermostable Hybridase™ were purchased 
from Epicentre Inc. (Madison, WI). In the RNaseH reaction, for each unit of RNaseH, 
200ng RNA (6pmol) and 150 (30 x5) pmol marking oligonucleotides were used in 

20 20|il RNaseH buffer (50 mM Tris-Cl, pH 7.4, 100 mM NaCl, 10 mM MgCl 2 ). The 
reaction mix was preincubated at 72°C for 3 minutes before adding the enzyme and 
transferring to a lower temperature specified by the type of RNaseH and the Tm of the 
marking oligo(s). When the E. coli RNaseH was used, the reaction was always held at 
37°C for 30 minutes and then stopped by addition of 10 mM EDTA. When the 

25 Hybridase™ was used, the reaction was held for one hour at a temperature 2-3 

degrees below the Tm of the marking oligo. For example, 45°C for the dodecamarkers 
(SEQ ID Nos: 10-14); 55 °C for the anti-BBS I and II (SEQ ID Nos: 16 and 17). 
When multiple oligonucleotides were included in a single reaction, the lowest Tm was 
used to determine this temperature. The treated pool can be amplified by RT-PCR and 

30 submitted to additional selection if needed. 

To examine the effect of this restriction treatment on the entire pool, a 
small amount (about 20-50 fmole) of radiolabeled tracer made from transcription with 
the same template preparation was included in the reaction, and a fraction of the 
reaction mix was then run on a 8% polyacrylamide, 7 M urea gel. The radiolabeled 

35 tracer was made with [a-32p] UTP (NEN Life Science Products, Boston, MA) using 
the T7-MAXIscript™ in vitro transcription kit (Ambion, Inc., Austin, TX) according 
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to the manufacturer's instructions. The radioactive bands on the gel were visualized 
by a phosphoimiger. The effect of this restriction treatment can also be assessed from 
the decrease of the relative abundance of the sequence family under restriction, using 
Southern dot blot analysis (see below). Finally, a representative sample of the pool 
5 can be cloned and sequenced to show the composition of a treated pool. The PCR 
products were directly cloned into the pSTBlue™-l blunt vector using the Perfect 
Blunt Cloning Kit from Novagen Inc. (Madison, WI) according to the manufacturer's 
instructions. DNA sequencing was performed using the Perkin Elmer/ Applied 
Biosystems Division Automated DNA Sequencer with BIG Dye™ Terminator 
10 chemistry and AmpliTaq-FS™ DNA polymerase by the DNA sequencing facility of 
the BioResource Center at Cornell University (Ithaca, NY). 

Southern Dot Blot Analysis: 

For each DNA pool representing a generation during the selection, 

1 5 about 20-40 ng (0.3-0.6 pmole) sample in 2jxl were prepared in lx SSC. Each DNA 
sample was denatured at 100°C for 10 min, chilled to 0°C quickly and kept at 0°C for 
more than 5 min before being applied onto a piece of Hybond-XL membrane 
(Amersham Pharmacia Biotech UK Ltd., Buckinghamshire, UK). The DNA was fixed 
to the membrane by baking at 80°C for 2 hours and UV cross-linking at 0.12 J/cm , 

20 254 nm. 

To prepare the probes, 20 pmole oligonucleotides (listed above) were 
labeled with T4 polynucleotide kinase (New England Biolabs, Inc., Beverly, MA) in 
20\i\ lx PNK buffer at the presence of 60 jiCi [y- 32 P] ATP (10,000 Ci/mmole, NEN 
Life Science Products, Boston, MA) according to manufacturer's instruction. 

25 Reactions were kept at 37°C for 1 hour, then stopped by adding 0.5 mM EDTA (pH 
8.0) to final concentration of 5 mM and heating to 65°C for 20 min. When detecting 
the constant region of the pool, 300 pmole cold probe (SEQ ID No: 18) was added to 
the[y- 32 P] labeled probe. 

The membrane was pre-hybridized for 30 min in 5x SSC/5x 

30 Denhardt's solution/0.5% SDS, before the probe was added. Hybridization was 

carried at 37°C overnight (i.e., from about 12 to about 1 8 hours). The membrane was 
washed three times, each in 0.1 x SSC/0.2% SDS at room temperature for 10 min. The 
signals on the membrane were collected by exposing to a phosphoimage screen, and 
quantified using ImagineQuant software (Molecular dynamics Inc. Sunnyvale, CA). 
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To reuse the membrane, the probes were stripped off by adding 100 ml 
boiling 0.1% SDS to the membrane and cooling gradually to room temperature. 

Filter Binding Assay: 

5 The templates for T7 transcription to produce individual MGMs and a 

control RNA were prepared using reciprocal primer extension with the T7FOR and 
COMP oligo pairs listed above (SEQ ID Nos: 19-26) during a single cycle PCR 
reaction. Each lOOpl primer extension reaction contains 40 pmole of each primer and 
yielded about l^g product. RNA was synthesized using the T7-MEGAshortScript™ 

10 in vitro transcription kit according to the manufacturer's instructions. The [a- P] 
UTP incorporated RNA was prepared using the MAXIscript™ in vitro transcription 
kit according to the manufacturer's instructions. The labeled RNA was used as a 
tracer and mixed with the cold RNA at the ratio of 1 :400-l :600. Different amounts of 
RNA were normalized into 20jal of lx Binding Buffer (50 mM Tris-HCl, pH 7.6, 0.5 

15 mM MgCb, 150 mM salt — KG, NaCl, or LiCl as indicated) to final concentrations in 
the range of 10-1000 nM. Each 20jxl RNA preparation also contained 1 jag of yeast 
tRNA as a non-specific competitor. The RNA preparations were incubated at 80° C for 
5 min and then cooled down to the ambient temperature and kept there for 30 min 
before loading to the filter. 

20 The setting of the filter-binding assay was modified from that of Wong 

and Lohman ("A Double-filter Method for Nitrocellulose-filter Binding: Application 
to Protein-nucleic Acid Interactions," Proc. Natl Acad. Sci. USA 90:5428-5432 
(1993), which is hereby incorporated by reference in its entirety). The BA-S85 NC 
membrane and NA45 DEAE membrane were purchased from Schleicher & Schuell 

25 Inc. (Keene, NH) and the GeneScreen Plus™ hybridization transfer membrane was 
obtained from NEN Life Science Products (Boston, MA). The membranes were first 
equilibrated with the proper lx Binding Buffer. A 24-well slot-blot apparatus was 
assembled with the positively charged DEAE or hybridization membrane at the 
bottom and the NC membrane on the top. The vacuum was adjusted such that the flow 

30 rate was about lOOjil/min. 4-6 samples were applied to the wells each time. For each 
sample, the well was first flushed with 100|il of lx Binding Buffer, followed 
immediately by the application of the RNA sample, and washed with another IOOjlxI of 
lx Binding Buffer immediately thereafter. With the vacuum on, the apparatus was 



-26- 



disassembled and the membranes were air-dried before exposure to a phosphoimage 
screen. 

The signals of each sample on the NC and the positively charged 
membrane were quantified using the ImageQuant software. The fraction of RNA 
5 retained on the NC membrane (using the sum of signals on both membranes as total) 
was calculated and plotted. 

Partitioning by Electrophoretic Mobility Shift Assay: 

A trace amount of radioactive RNA from the same RNA pool was 
10 included in binding reactions. The binding reaction mixtures were set at 4°C for 5-10 
minutes before being loaded onto a 2.5% agarose gel in 1/4 TBE buffer, which was 
run at 4°C, The protein-bound RNA contained in the shifted band was visualized by 
autoradiography, retrieved, extracted, and amplified. 

1 5 Example 1 - MGM Restriction on the B52 Selected G9 Pool 

To utilize the RNaseH activity for the purpose of selectively 
eliminating an RNA sequence family, several issues needed to be addressed before 
the procedure could be applied to a heterogeneous sequence pool. The marking 

20 oligonucleotide has a minimal required length for it to anneal to the target sequence 
and be recognized by the enzyme. This length may exceed the length of the 
consecutive consensus sequence of the aptamer to be eliminated. In order not to affect 
any other sequence in the pool, the fidelity of hybridization must be ensured. Longer 
oligonucleotides carry more information and thus are more specific if the reaction is 

25 carried out at or near its melting temperature. Therefore an RNaseH that is stable over 
a wide range of temperature is desirable. These conditions were tested on three single 
representative MGM sequences, with a random sequence as control, before extending 
this method to the sequence pools. Marking oligonucleotides from 6 to 21 nucleotides 
in length were tested with the E. coli RNaseH at 37°C or a thermostable RNaseH 

30 (Hybridase) at 45°C. E. coli RNaseH is more efficient but less specific than the 

Hybridase. A marking oligonucleotide at least 8-nt in length is needed in a reaction 
with E. coli RNaseH at 37°C. For the Hybridase at 45°C the oligonucleotide should 
be 12-nt or longer. 
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Since different sequences may give rise to common three-dimensional 
structures by positioning key functional groups in similar spatial orientations, the 
related sequences in an aptamer family may display different degrees of homology. 
Nonetheless, the number and identity of nucleotides in regions that are functionally 
5 important usually show high degree of homology. The G-triplets in a MGM may fall 
into this category. Variations, including deletions and insertions, usually occur in 
regions serving topological roles. Presumably this is the case of the spacers between 
the G-triplets. Multiple sequence alignment of MGM family revealed two challenges 
to test the limit of the method being developed. First, because the spacer between the 

10 G-triplets varies in both nucleotide number and identity, the consecutive consensus of 
the family is no more than 3-4 nucleotides in length. A degenerate population of 
oligonucleotides with a single G-triplet in the middle would contain little information 
to specify a family member; therefore, longer oligonucleotides, longer than or equal to 
8 nucleotides, need to be designed in the attempt to cover the entire family or its 

1 5 majority. Second, poor sequence similarity among the family members necessitated a 
set of representative marking oligonucleotides, rather than one oligonucleotide only. 
This set would hybridize to overlapping subgroups of the family, which would result 
in high degree of imperfect hybridization during the "earmarking" step for the 
RNaseH treatment, thus decreasing its efficiency. 

20 Based on their frequency in the MGM family, a group of five octamers 

(SEQ ID Nos: 5-9) was chosen, each covering two adjacent G-triplets separated by 
one or two other nucleotides (the average spacer is three nucleotides in length), in the 
hope that this imperfectly representative set may cover the majority of the family. 
These were used in treatment by the E. coli RNaseH. For the Hybridase, a set of 

25 dodecamers was prepared by appending two degenerate positions on each end (SEQ 
ID Nos: 10-14). Repeated annealing and RNaseH treatments were performed to 
address the problem of imperfect hybridization. 

With the "octamarker set" (SEQ ID Nos: 5-9) or the "dodecamarker 
set" (SEQ ID Nos: 10-14), RNA restriction treatments with RNaseH were performed 

30 on the unselected GO pool and the selected G9 pool using the optimal conditions 
identified with single MGM clones. Each marker set contains equal molar amount of 
the five oligonucleotides. A significant decrease in population size was observed in 
the MGM-dominated G9 but not GO (Figure 3B). This result is confirmed by 
comparing the amount of amplified RNA after the treatment by quantitative RT-PCR. 
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Once treated, the selected pool became much more resistant to a second treatment. 
Family- specific probing revealed a decrease in the relative abundance of MGMs and a 
correlated increase in the relative abundance of BBSs after the treatments (Figures 
3 A-B). After two consecutive restriction treatments, pools G9-M/M(E) and G9- 
5 M/M(H) were cloned. (The nomenclature indicates that they were treated twice (- 
M/M) to eliminate the MGM family by either £.coli RNaseH or the .tfybridase, and 
the relationship between all pools derived from the B52 G9 is depicted in Figures 2A- 
B) In both pools, the sequences were categorized in the following five groups: the 
BBS family, the MGM family, the C-rich family (apparently being indirectly selected 

10 by the G-rich MGM), short random regions (possibly resulting from deletion during 
repeated amplification or cloning), and sequences not belonging to the above families. 
The ratio of BBS to MGM is dramatically increased in both pools compared to the 
untreated G9 pool, as indicated in Figures 3 and 4. None of the surviving MGM 
isolates contained any consecutive sequence segment that would be perfectly matched 

15 by any marking oligonucleotide used, indicating an effective elimination of the 
marked species. The relative proportions of the four non-MGM groups are nearly 
identical in both treated pools. However, there are more MGM in the E. coli RNaseH 
treated pool than in the one treated with the Hybridase, indicating that treatment with 
longer marking oligonucleotides at higher temperature by the Hybridase is preferable. 

20 Moreover, a new representative consensus can be extracted from the surviving 

species, in which two adjacent G-triplets are separated by three A or T positions, A 
new marking oligonucleotide, NCW13 (SEQ ID No: 15), was designed and used in 
the treatment of another pool selected by a different target, and this improved the 
efficiency of MGM ablation (see Example 3 infra). 

25 

Example 2 - Restriction of MGM, BBS, or Both, on the B52 selected G9 Pool 

As described herein above, in addition to mining less prevalent RNA 
aptamer families from a selected pool, this population restriction method can be 
30 extended to a general approach for generating different ligands to all of the targets in 
a mixture. To test the efficiency and versatility of this method, both the MGM family 
and the BBS family were removed, successively or simultaneously, from the B52 
selected G9 pool and its derivatives. The relationship among the pools derived from 
B52 G9 is shown in Figure 2. In contrast to the MGM family, the BBS family has a 
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highly homologous, long consensus, and can be marked by two 1 7-mer sequences 
with only three positions different between them. The reaction was carried on at a 
higher temperature, 55°C, when only BBSs were to be eliminated, further increasing 
the specificity. Depletion of either or both families was confirmed by Southern dot 
5 blot analyses with the marking oligonucleotides as probes, as shown in Figure 3. 

Example 3 - MGM Restriction on the HSF Selected G9 Pools and Subsequent 
Selection of an HSF Aptamer 

10 To confirm the efficiency of the method developed in previous 

examples, and to provide a more general case of exhaustive selection and matrix- 
binding reduction, two other selections were performed against the Drosophila heat 
shock factor (HSF) (Wu, "Heat Shock Transcription Factors: Structure and 
Regulation", ylwz. Rev. Cell Dev. Biol. 11:441-469 (1995), which is hereby 

1 5 incorporated by reference in its entirety), which regulates the activation of heat shock 
gene transcription by binding to specific promoter elements. 

The first stage of the selection used as the starting material the same 
initial unselected RNA pool as that used in the B52 selection, The target HSF was 
expressed and purified as a GST fusion protein in E. coli. The selection scheme and 

20 conditions were almost identical to that used for B52 except the following minor 
modifications. In one selection (designated L), a negative selection against 
nitrocellulose filter following the recovery of bound RNA was performed in Cycle 8. 
In the other selection (designated H), a higher concentration (50mM) of magnesium 
ion was used in the binding buffer. Based on the experience with the B52 selection, 

25 this stage of selection was expected to yield pools dominated by MGMs. Indeed, 

when the pools from the seventh and the ninth generations of Selection L were cloned 
and sequenced, all but one sequence were MGM's. A Southern (dot) blot analysis on 
the genealogical archives (saved fractions of pools from each cycle of selection) of 
both experiments with a representative MGM probe showed an effective enrichment 

30 of the MGM family over the generations (see Example 4 infra). The contours of the 
evolutionary trajectories are almost identical with that in the B52 selection, but 
apparently the MGM family was enriched even more efficiently in Selection L. 

Three consecutive RNA restriction treatments were performed on the 
two G9 pools to eliminate the MGM. The first two treatments were done with the 

35 same protocol and reagents that produced B52 G9-M/M, namely, with the 
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"dodecamarker set" (SEQ ID Nos: 10-14) and the Hybridase at 45°C. After examining 
the sequences of the two B52 G9-M/M pools, a group of surviving MGM was 
identified and a new marking oligo, NCW13 (SEQ ID No: 15), was designed. The 
NCW13 was used together with the dodecamarkers in the third treatment of the HSF 
5 G9 pool to yield the two G9-M/M/M pools that were cloned. The relationship 

between pools derived from the two HSF G9 pools is depicted in Figure 2. Twenty- 
two individuals from the G9/L-M/M/M pool and twenty-seven from the G9/H- 
M/M/M pool were sequenced. Compared to the B52 G9-M/M pools, MGMs have 
been further decreased due to the use of NCW13. Among the remaining isolates about 

10 half are C-rich sequences, as was seen in the B52 G9-M/M pools. 

Considering the sequence data of these treated pools, it was speculated 
that a very small fraction of true HSF aptamers may still exist in the final generation 
(G9) dominated by MGM family. After the MGM restriction treatment, a second 
stage of selection was performed using the G9/H-M/M/M pool as starting material 

1 5 (designated Generation 9a in Figure 5A). During this stage, a His-tagged HSF is used 
as the target; and the magnesium concentration is kept at 5mM under the condition 
otherwise identical to that used previously. Three cycles of selection with filter- 
binding as partitioning method were performed to yield Generations 10, 11, and 12. A 
negative selection step against nitrocellulose filter following the recovery of bound 

20 RNA was included in these cycles as in Cycle 8 of Selection L. Two more cycles 
using electrophoretic mobility shift as partitioning method were performed to yield 
Generations 13 and 14. When G14 was cloned and sequenced, five individuals in a 
sample of 19 have an identical sequence that was subsequently verified to have 
contained a high-affinity aptamer of HSF (RA1-HSF). 

25 

Example 4 - Evolutionary Trajectory of the MGM Family in Three 
Independent Selections 

In the above examples, the population restriction method was used to 
30 mine less prevalent RNA aptamer families from a selected pool. However, the method 
can also be used to control the population of an aptamer family, such as MGM, during 
the selection and before it dominates the pool. The population dynamics of this family 
in the selection process in the absence of restriction would reveal a proper time point 
(or generation) for the intervention to begin in order to prevent MGM domination in 
35 the selected pools. To this end, the evolutionary trajectories of the MGM family were 
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mapped in all three selection experiments mentioned above. Representative sequences 
of the MGM family were used as probes to detect the family members and reveal their 
relative concentration in the DNA pools in a Southern (dot) blot analysis. Since the 
spacing between G-triplets varies considerably it is not possible to find a sequence or 
5 a small set of sequences long enough to cover all MGM clones. Two short sequences, 
each containing two G-triplets separated by only one or two nucleotides, were 
extracted and two more degenerated flanking positions were added to construct the 
representative probes for the MGM family (SEQ ID Nos: 10 and 1 1). In all of the blot 
analyses, the 5' constant region on each sequence was probed (using SEQ ID No: 18) 

10 to normalize the signal. As shown in Figures 6, the time series of relative abundance 
as probed by these probes revealed a steady enrichment trajectory. They did not 
become significantly abundant until after Cycle 3. Towards the end, the rate of 
enrichment slowed down, as there was less selective pressure between the winning 
clones and the bulk pool. If the RNA restriction method is applied at Cycle 3 and 

1 5 thereafter, the MGM family would not have dominated the final selected pools. 

Example 5 - Ion Dependence of the MGM Affinity to the Filter 

A possible structure of MGM is suggested by the presence of 4 G- 
20 triplets. It has been reported that poly rG can form four-stranded helices (G- 

quadruplexes) (Zimmerman, et al., "X-ray Fiber Diffraction and Model-building 
Study of Polyguanylic Acid and Polyinosinic Acid,' V. MoL Biol. 92:181-192 (1975), 
which is hereby incorporated by reference in its entirety). RNAs containing short runs 
of Gs can also tetramerize (Kim et al., " Tetramerization of an RNA Oligonucleotide 
25 Containing a GGGG Sequence," Nature 351:33-332 (1991), which is hereby 

incorporated by reference in its entirety). The quartets of G residues on which such 
structures depend consist of four guanine bases in a planar array arranged in a cyclic 
hydrogen-bonding pattern, where each guanine is both the donor and acceptor of two 
hydrogen bonds between the Hoogsteen and Watson-Crick faces. This arrangement 
30 generates a pocket at the center lined by electronegative carbonyl oxygens that is 

thought to be the site of interaction with a cation. Although such structure is stable in 
solution, it exhibits a strong dependence of the melting temperature on the 
monovalent ion present (Williamson, "G-quartet Structures in Telomeric DNA," Ann. 
Rev. Biophys. Biomol Struct. 23:703-730 (1994), which is hereby incorporated by 
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reference in its entirety). The original binding buffer used in the selection experiment 
contains 150mM potassium acetate, which reflects the intracellular concentration of 
potassium. In a binding reaction, K + was displaced with identical concentrations of 
Na + and Li + . As a result, a dramatic decrease of filter binding activity was observed, 
5 almost to the level of the unselected pool (Figure 7). This result strongly suggests that 
the MGM sequences form G-quartets or similar structures that are active in binding to 
the filter, and this feature of MGM can also be explored in the process of selection to 
decrease the abundance of this family in the pools. 

10 Example 6 - Theoretical Foundation for Observed Results 

To describe the process of in vitro evolution, the approach of Eigen 
can be adopted and extended (Eigen, "New Concepts for Dealing with the Evolution 
of Nucleic Acids," Cold Spring Harb. Symp. Quant Biol 52:307-320 (1987), which 
1 5 is hereby incorporated by reference in its entirety). Based on the exponential model 
for single species growth and decay 

where k is growth constant and c is concentration at time t, the clone i (descendants of 
a single ancestor individual in the original unselected pool) can be followed in terms 
20 of relative concentration of its members 




and its relative growth constant 

K rel = k-Kt) 

which is its own growth constant (fa) relative in comparison with the average growth 
25 constant 

The growth constant k t of a clone i is a function of its member's 
fitness. The more fit they are to perform the task selected for, the more possible the 
individuals in the clone will be rewarded to reproduce and, thus, the faster the clone 
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will grow. Because the total number of individuals is limited and fixed in each round 
of selection to ensure competition, the average growth constant is a function of time 
and will increase over the cause of the experiment. In each round of selection and 
amplification, the life cycle of individuals is synchronized; thus, time becomes 
5 discrete, analogous to computational updates in parallel processing. For a generic 
clone in the pool, its growth or decay is described as 

*,={*, -*(')}*, (0 
10 h> k(t), x i > 0; kj < k(t), xj < 0. 

Eventually, the average growth constant will aproach the maximal 
growth constant k m , which is the that of the clone with highest fitness, and this clone 
(m) will expand to a point where its relative concentration is close to 1 . A small 
15 sampling at this point will yield multiple isolates of indiviudals belonging to this 
clone, sometimes along with a few other individuals with growth constant, i.e., 
fitness, ranking next to it. 

If the fitness rank can be altered after the isolation of this first aptamer 
clone, i.e., if the growth constant of this isolated clone can be decreased to a reduced 
20 value k rm , so that 

0 ^ krm < km 

then other aptamer clones with lower previously defined fitness can be identified. In 
particular, when k rm < k m _j, and x w ./>0, the clone m-1 that ranked next to m will be the 
sure winner of the next stage of selection, if it has not yet been isolated together with 

25 m. In general, aptamer clone m-n can be isolated by iteration of this process, until the 
(n+ 1 )* aptamer shows no specific binding to any known target in the system or its 
growth rate is indistinguishable from the remaining non-specific binding individuals. 
Theoretically, this scheme should be exhaustive, as every individual in the original 
pool with affinity to any target will eventually have a chance to be isolated, one by 

30 one, according to their rank of "growth rate." 

As the first implementation of this scheme to demonstrate the effect of 
fitness change in different stages of in vitro evolution, a system with more than one 
target was sought in which aptamer clones for different targets are expected to f, grow" 
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at significantly different rates under well-defined selective conditions. A minimal 
case should have two targets, either separable or inseparable. (From an operational 
point of view, separable targets presented in a fixed ratio of concentration are 
tantamount to a set of inseparable targets.) During the first stage of the selection 
5 procedure, selection should yield aptamers for one target; but the isolation of 
aptamers for the other target in a second stage of the selection procedure would 
require a fitness change (that reduces the growth constant) for the aptamers previously 
isolated. The large difference of growth rate in the first stage would contribute to the 
clarity of the demonstration; and two separable targets would lend analytical ease to 
1 0 this system. 

To fulfill these requirements, in Example 3 aptamers for a protein 
target, the Drosophila heat shock factor (HSF), were selected for identification, and 
aptamers for nitrocellulose filters used in the partitioning step, were selected in 
different stages of in vitro evolution, all starting from a single ancestral pool of RNA. 

1 5 The heat shock factors regulate the activation of heat shock gene transcription by 
binding to specific promoter elements. Upon heat treatments, HSF forms 
homotrimers that bind to specific DNA sequences called heat shock elements (HSEs). 
Other functions of HSF are not fully understood. Drosophila HSF contains at least 
three domains: a DNA-binding domain, a trimerization domain, and an acidic 

20 activation domain. It does not contain any known RNA recognition sequence motifs 
and is not known to bind to RNA in nature. The filters are chosen as one of the 
targets, not because of the utility of their aptamers, but because in Example 1 they 
yielded a fortuitous family of aptamers, named multi-G motifs (MGMs), whose 
evolutionary trajectories were characterized. Base on these previous results, this 

25 aptamer family can be expected to grow at a fast rate under known conditions. The 
population control procedure was also tested to be effective on this family in Example 
2. 

The first stage of the selection used as the starting material the same 
initial unselected RNA pool that previously yielded the MGM family. The target HSF 
30 was expressed and purified as a GST fusion protein in E. coli. The selection scheme 
and conditions were almost identical to that described above except that higher 
magnesium concentration was used to help RNA molecules fold. A Southern dot blot 
analysis on the genealogical archives (saved fractions of pools from each cycle of 
selection) with a representative MGM probe showed an effective enrichment of the 
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MGM family through the first to the nine generations (see Figure 5 A), with the 
contour of its evolutionary trajectory almost identical to that in the selection involving 
another B52 (see Figure 6). Sequencing of a single individual isolated from the ninth 
generation yielded a typical MGM. 
5 Usually a small collection of isolates (10-100 individuals) is sequenced 

at the end of an experiment to identify selected aptamers. Starting from a pool of 10 13 
candidates, this would require a more than 10 1 1 fold enrichment to see multiple 
isolates of a clone. And Clone m-1 should have a growth rate close to that of Clone m 
to be isolated in this sampling. If a lot more individuals in the MGM-dominated G9 

1 0 were sequenced, then it might have been possible to see aptamers for HSF. But for 
the purpose of the current study, a second stage of evolution was performed to allow 
HSF aptamers to dominate the selected pools in an environment where the growth rate 
of MGMs is reduced asymptotically to zero. This stage can use as starting material 
any pool in which the relative concentration of any HSF aptamer does not equal zero. 

1 5 The last generation, G9, was selected because the procedure can appropriately be 
expected to succeed with any other pools from GO to G8 if success can be achieved 
with G9, which is the least complex. 

Three consecutive RNA restriction treatments were performed on the 
G9 pools to eliminate the MGM and generate the pool designated G9a (see Figures 

20 5 A-5B). The first two treatments were done with the "dodecamarker" set and the last 
included an additional marking oligo, NCW1 3. This treatment can also be used in the 
subsequent rounds of selection to reduce the growth rate of MGM. However, a 
negative selection (collecting candidates not binding to the filter) is more convenient 
when the targets are separable. Three cycles of selection with filter-binding as 

25 partitioning method were performed with a His-tagged HSF construct to yield 

Generations 10, 11, and 12. Each of these cycles included a negative selection step 
against nitrocellulose filter following the recovery of bound RNA. Two more cycles 
using electrophoretic mobility shift as partitioning method with His-HSF were 
performed to yield Generations 13 and 14. When G14 was cloned and sequenced, 

30 two HSF aptamers dominated the pool, one of them (named RA1-HSF) constituted 
more than 25% of the sample and showed high affinity to HSF. No MGM showed up 
in this sample. 

The process of this two-stage evolution experiment can be 
characterized by following the enrichment of MGMs and RA1-HSF through the 
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generations using a Southern (dot) blot analysis on the genealogical archives. The 
distinct profiles of these two classes of aptamers are shown in Figures 5A-5B. To 
verify these profiles, the G5 and the G9a pools were also cloned and sequenced, in 
addition to G9 and G14, and present a summary of the sequencing results in Table 1 
5 below. 



Table 1 : Summary of Sequencing Aptamers from G5, G9, G9a, and G14 Generations 





G5 


G9 


G9a 


G14 


MGMs 


10 


1 


1 


0 


C-rich sequences 


7 


0 


12 


3 


HSF aptamers 


0 


0 


3 


8 


orphan sequences 


13 


0 


10 


9 


Total 


30 


1 


26 


20 


Multiple isolates 


2x MGM#3 


0 


3x #9a-l 


6xRAl-HSF 










2x #14-2 



10 According to the model, clones with kg greater than the average growth 

constant of the initial pool but lower than k m would grow (be enriched) during the 
course of the experiment to a point where k ( is equal to the average growth constant. 
Then they will decay and eventually become extinct. Sequencing of earlier pools 
allowed capture of sequences for clones becoming extinct later and tracing of their 

1 5 abundance over generations. The dot blot analysis was used to detect a range of 
relative concentration xt of a sequence from 1 to 10" 3 , thus even a ten-billion-fold 
enrichment from 10~ 13 to 10" 3 will not be revealed. Nevertheless, a collection of 
representative clones was obtained and their growth and decay was followed during 
the second stage of the experiment to provide a well-rounded survey of the process. 

20 The sequences of the entire randomized sequence, oligonucleotides 40 bases in length 
in most cases, were used as probes. These may not be clone-specific, and could pick 
up signals from a family with a common consensus fragment shorter than the 
randomized region. In general, members of such a sequence family have similar 
functional traits. 

25 RA1-HSF is the most abundant clone (6 individuals out of 20 isolates) 

in the pool G14 (compare graphs of Figure 8 A). It showed the highest affinity among 
isolated clones in an electrophoretic mobility shift (EMS) assay to HSF. It was not 
present in the sample from the other three pools. Its enrichment profile showed 
dramatic growth toward the end of the second stage without sign of slowing rate 
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(Figure 8 A). Clone 14-2 was isolated twice in the G14 sample, and showed a weaker 
affinity to HSF in an EMS assay (Figure 8A). It showed a moderate enrichment 
during the early cycles of the second stage, then decayed towards the end. A similar 
profile was observed for another clone with weak affinity to HSF, Clone 9a- 1 (Figure 
5 8 A). This clone was isolated 3 times in a sample of 26 individuals from G9a, but not 
present in the sample from G14. One other clone was randomly picked from each of 
G5 (clone 5-23), G9a (clone 9a-21), and G14 (14-13). Each of them isolated only 
once. None of them showed detectable affinity to HSF. Their signal in the dot blot 
analysis cannot be distinguished from the background (Figure 8B). As shown in 

1 0 Figures 8A-B, these results demonstrate a correlation between the aptamers' affinity to 
HSF and their growth rate during the second stage, providing experimental support to 
the connection between fitness and growth during in vitro evolution. 

As shown in Figure 9 A, the affinity of RA1-HSF to HSF was tested 
with several different HSF preparations, including GST-HSF, a His-HSF, a MBP- 

15 HSF, all purified from bacteria, and baculovirus expressed untagged HSF in crude 

extract of Sf-9 cells. RA1-HSF can also bind to truncated HSF constructs without the 
activation domain, suggesting that it probably binds to the DNA binding domain. 
Estimated from the EMS data, it has a Kd of 20nM. In contrast, monomelic HSF has 
very weak affinity to DNA with Kd in the range of jaM. 

20 The RA1 -HSF clone was further characterized by deletion analysis to 

define the true aptamer moiety (Figure 9B). Deleting the 3' constant region and 5 
nucleotides next to it did not compromise the affinity of the remaining sequence. 
However, deletion of the 5 f constant region abolished binding capability, indicating 
that both 5 f constant region and the randomized region are parts of the true aptamer. 

25 Without changing the topography of fitness landscape, the 

conventional in vitro selection and amplification methodology constitutes a single act. 
From the conventional perspective, the first stage in the above examples is a 
successful experiment for the isolation of filter-binding aptamers but a failed one with 
regard to HSF, and second stage was a success for HSF but a failure for the filter- 

30 binding aptamers. However, in an experiment with complex targets, the aim is to 
isolate aptamers for all targets. In the two-target case presented in the above 
examples, this goal was fulfilled by setting different fitness landscapes for different 
stages. Throughout the first stage and most of the second stage, the two targets were 
present in a fixed ratio as if they were inseparable from each other. If they were 
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indeed inseparable, it would not be realistic in the second stage to use 
positive/negative toggle selection and EMS as a partitioning device. But negative 
selection against the genotype (sequence) of MGMs tested previously can instead be 
used. The efficiency of this method is demonstrated here again by the difference in 
5 composition between pools G9 and G9a (see Figures 5A-5B). Therefore, the 

procedure presented here can be adopted for experiments with complex inseparable 
targets with little modification. 

In a multi-stage selection scheme, the most fit aptamer clone or clones 
in one stage are converted to the least fit one(s) in the next stage, thus allowing clones 

10 dominate the selected pools in successive stages in an order according to their original 
rank of fitness in first stage. While the "two-stages-for-two-targets" experiment 
presented in the above examples is a minimal case, a corresponding number of 
selection stages for additional targets can be implemented easily according to this 
same model. Experimentally, negative selection according to genotypes of candidate 

1 5 clones can be executed in an effectively parallel manner. In the above examples, the 
second stage was started with a pool of low complexity, taking considerable risk of 
drift caused by both random processes and the MGM restriction treatment. In later 
stages, though, it would be safer to start with a more complex pool, even the original 
unselected pool. 

20 RNA aptamers isolated from randomized sequence pools are often 

embedded in different sequence contexts and possess some variations, constituting an 
artificial phylogeny. While some "point mutations" were found in different 
individuals of the RA1-HSF clone, no clones sharing a consensus sequence with it in 
the randomized region were isolated. The low complexity of the starting pool of the 

25 second stage may be a factor contributing to this result. The deletion analysis (Figure 
9B) also suggests that the RA1-HSF aptamer is complex with high information 
content, and therefore occurs rarely in the unselected pools. With more information 
about the aptamer, it is possible to address this issue by a positive selection according 
to genotype. The sequence in the randomized region required for binding to HSF can 

30 be used as a probe to isolate containing this segment but different from RA1 -HSF. 

The HSF plays a central role in cellular stress response. It senses and 
integrates signals from extracellular and intracellular environments and regulates the 
expression of heat shock genes whose products are chaperones that help proteins 
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maintain stable folded conformations. Aptamers for HSF can be used as effective 
molecular probes to study and control this important and complex process. 

Although the invention has been described in detail for the purpose of 
5 illustration, it is understood that such detail is solely for that purpose, and variations 
can be made therein by those skilled in the art without departing from the spirit and 
scope of the invention which is defined by the following claims. 



